Block similarity in fuzzy tuples

نویسندگان

  • Carl Frélicot
  • Hoel Le Capitaine
چکیده

A common problem in decision-making is to analyze a tuple of numerical values associated with options, such as the degree of satisfaction assigned by experts to alternatives or probability values for hypotheses computed from data. With no loss of generality, it is assumed that the tuple contains values in the unit interval. For post-processing of typical value(s), singular values that may arise from noise in the data, or from unreliable experts, must not be taken into account. We present the concept of block similarity to address the problem of detecting subset(s) of typical values instead of extracting singular ones. The concept relies on suitable aggregation operators that combine the tuple components. Three different block similarity operators are proposed and discussed. These rely on Sugeno integrals of the tuple with respect to three different measures, namely cardinal weighting, symmetric kernel weighting and non-linear weighting. Numerical examples demonstrate their behaviors and their ability to detect blocks of similar values.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Constructing a fuzzy controller from data

Fuzzy control can be interpreted as an approximation technique for a control function based on typical, imprecisely speciied input{output tuples that are represented by fuzzy sets. The imprecision is characterized by similarity relations that are induced by transformations of the canonical distance function between real numbers. Taking this interpretation of fuzzy controllers into account, in o...

متن کامل

Attribute - oriented defuzzification of fuzzy database

We are investigating the ability to data mine fuzzy tuples, which are often utilized to represent uncertainty about the registered information. We discuss different aspects of fuzzy databases and comment on practical advantages of the model we utilized in our research. Motivated by a well known technique called Attribute-Oriented Induction, which has been developed for summarization of ordinary...

متن کامل

Searching for a compromise between satisfaction and diversity in database fuzzy querying

This paper deals with fuzzy queries and describes an approach that aims at providing users with a set of answers which satisfies a diversity criterion on one or several attributes. Different cases are considered and two types of algorithms are described. The first one, which has a linear complexity in terms of the number of tuples in the result, is suited to the case where the notion of similar...

متن کامل

Distributed Data Deduplication

Data deduplication refers to the process of identifying tuples in a relation that refer to the same real world entity. The complexity of the problem is inherently quadratic with respect to the number of tuples, since a similarity value must be computed for every pair of tuples. To avoid comparing tuple pairs that are obviously non-duplicates, blocking techniques are used to divide the tuples in...

متن کامل

Eliminating Fuzzy Duplicates in Data Warehouses

1 Work done while visiting Microsoft Research Abstract The duplicate elimination problem of detecting multiple tuples, which describe the same real world entity, is an important data cleaning problem. Previous domain independent solutions to this problem relied on standard textual similarity functions (e.g., edit distance, cosine metric) between multi-attribute tuples. However, such approaches ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Fuzzy Sets and Systems

دوره 220  شماره 

صفحات  -

تاریخ انتشار 2013